Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 119390 |
| Missing cells | 492 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 21.0 MiB |
| Average record size in memory | 184.0 B |
Variable types
| Numeric | 12 |
|---|---|
| Categorical | 11 |
country has a high cardinality: 177 distinct values | High cardinality |
repeatFlag is highly correlated with historicBookings | High correlation |
historicBookings is highly correlated with repeatFlag | High correlation |
repeatFlag is highly correlated with historicBookings | High correlation |
historicBookings is highly correlated with repeatFlag | High correlation |
roomType is highly correlated with assignedType | High correlation |
assignedType is highly correlated with roomType | High correlation |
Unnamed: 0 is highly correlated with type and 2 other fields | High correlation |
type is highly correlated with Unnamed: 0 and 1 other fields | High correlation |
canceledFlag is highly correlated with Unnamed: 0 | High correlation |
arrivalMonth is highly correlated with arrivalWeek | High correlation |
arrivalWeek is highly correlated with Unnamed: 0 and 1 other fields | High correlation |
numberWeekendnights is highly correlated with numberNights and 1 other fields | High correlation |
numberNights is highly correlated with numberWeekendnights | High correlation |
chidren is highly correlated with roomType | High correlation |
segment is highly correlated with deposit | High correlation |
historicCancellations is highly correlated with historicBookings | High correlation |
historicBookings is highly correlated with historicCancellations | High correlation |
roomType is highly correlated with chidren and 1 other fields | High correlation |
assignedType is highly correlated with type and 1 other fields | High correlation |
changesFlag is highly correlated with numberWeekendnights | High correlation |
deposit is highly correlated with segment | High correlation |
historicCancellations is highly skewed (γ1 = 24.45804872) | Skewed |
historicBookings is highly skewed (γ1 = 23.53979995) | Skewed |
Unnamed: 0 is uniformly distributed | Uniform |
Unnamed: 0 has unique values | Unique |
time2Checkin has 6345 (5.3%) zeros | Zeros |
numberWeekendnights has 51998 (43.6%) zeros | Zeros |
numberNights has 7645 (6.4%) zeros | Zeros |
historicCancellations has 112906 (94.6%) zeros | Zeros |
historicBookings has 115770 (97.0%) zeros | Zeros |
changesFlag has 101314 (84.9%) zeros | Zeros |
waitingDays has 115692 (96.9%) zeros | Zeros |
numberofRequests has 70318 (58.9%) zeros | Zeros |
Reproduction
| Analysis started | 2021-12-28 05:04:42.949296 |
|---|---|
| Analysis finished | 2021-12-28 05:05:03.982468 |
| Duration | 21.03 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 119390 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 59694.5 |
| Minimum | 0 |
|---|---|
| Maximum | 119389 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 932.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5969.45 |
| Q1 | 29847.25 |
| median | 59694.5 |
| Q3 | 89541.75 |
| 95-th percentile | 113419.55 |
| Maximum | 119389 |
| Range | 119389 |
| Interquartile range (IQR) | 59694.5 |
Descriptive statistics
| Standard deviation | 34465.06866 |
|---|---|
| Coefficient of variation (CV) | 0.577357523 |
| Kurtosis | -1.2 |
| Mean | 59694.5 |
| Median Absolute Deviation (MAD) | 29847.5 |
| Skewness | 0 |
| Sum | 7126926355 |
| Variance | 1187840958 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 62155 | 1 | < 0.1% |
| 4823 | 1 | < 0.1% |
| 6870 | 1 | < 0.1% |
| 725 | 1 | < 0.1% |
| 2772 | 1 | < 0.1% |
| 13011 | 1 | < 0.1% |
| 15058 | 1 | < 0.1% |
| 8913 | 1 | < 0.1% |
| 10960 | 1 | < 0.1% |
| Other values (119380) | 119380 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 119389 | 1 | |
| 119388 | 1 | |
| 119387 | 1 | |
| 119386 | 1 | |
| 119385 | 1 | |
| 119384 | 1 | |
| 119383 | 1 | |
| 119382 | 1 | |
| 119381 | 1 | |
| 119380 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 932.9 KiB |
| C | |
|---|---|
| R |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | R |
|---|---|
| 2nd row | R |
| 3rd row | R |
| 4th row | R |
| 5th row | R |
Common Values
| Value | Count | Frequency (%) |
| C | 79330 | |
| R | 40060 |
Length
Pie chart
| Value | Count | Frequency (%) |
| c | 79330 | |
| r | 40060 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 932.9 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 75166 | |
| 1 | 44224 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 75166 | |
| 1 | 44224 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 479 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 104.0114164 |
| Minimum | 0 |
|---|---|
| Maximum | 737 |
| Zeros | 6345 |
| Zeros (%) | 5.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 932.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 18 |
| median | 69 |
| Q3 | 160 |
| 95-th percentile | 320 |
| Maximum | 737 |
| Range | 737 |
| Interquartile range (IQR) | 142 |
Descriptive statistics
| Standard deviation | 106.863097 |
|---|---|
| Coefficient of variation (CV) | 1.027416997 |
| Kurtosis | 1.696448849 |
| Mean | 104.0114164 |
| Median Absolute Deviation (MAD) | 60 |
| Skewness | 1.346549873 |
| Sum | 12417923 |
| Variance | 11419.72151 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 6345 | 5.3% |
| 1 | 3460 | 2.9% |
| 2 | 2069 | 1.7% |
| 3 | 1816 | 1.5% |
| 4 | 1715 | 1.4% |
| 5 | 1565 | 1.3% |
| 6 | 1445 | 1.2% |
| 7 | 1331 | 1.1% |
| 8 | 1138 | 1.0% |
| 12 | 1079 | 0.9% |
| Other values (469) | 97427 |
| Value | Count | Frequency (%) |
| 0 | 6345 | |
| 1 | 3460 | |
| 2 | 2069 | 1.7% |
| 3 | 1816 | 1.5% |
| 4 | 1715 | 1.4% |
| 5 | 1565 | 1.3% |
| 6 | 1445 | 1.2% |
| 7 | 1331 | 1.1% |
| 8 | 1138 | 1.0% |
| 9 | 992 | 0.8% |
| Value | Count | Frequency (%) |
| 737 | 1 | < 0.1% |
| 709 | 1 | < 0.1% |
| 629 | 17 | |
| 626 | 30 | |
| 622 | 17 | |
| 615 | 17 | |
| 608 | 17 | |
| 605 | 30 | |
| 601 | 17 | |
| 594 | 17 |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 932.9 KiB |
| August | |
|---|---|
| July | |
| May | |
| October | |
| April | |
| Other values (7) |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 5.903182846 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | July |
|---|---|
| 2nd row | July |
| 3rd row | July |
| 4th row | July |
| 5th row | July |
Common Values
| Value | Count | Frequency (%) |
| August | 13877 | |
| July | 12661 | |
| May | 11791 | |
| October | 11160 | |
| April | 11089 | |
| June | 10939 | |
| September | 10508 | |
| March | 9794 | |
| February | 8068 | |
| November | 6794 | |
| Other values (2) | 12709 |
Length
| Value | Count | Frequency (%) |
| august | 13877 | |
| july | 12661 | |
| may | 11791 | |
| october | 11160 | |
| april | 11089 | |
| june | 10939 | |
| september | 10508 | |
| march | 9794 | |
| february | 8068 | |
| november | 6794 | |
| Other values (2) | 12709 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 53 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.16517296 |
| Minimum | 1 |
|---|---|
| Maximum | 53 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 932.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 16 |
| median | 28 |
| Q3 | 38 |
| 95-th percentile | 49 |
| Maximum | 53 |
| Range | 52 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 13.60513836 |
|---|---|
| Coefficient of variation (CV) | 0.500830176 |
| Kurtosis | -0.9860771763 |
| Mean | 27.16517296 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | -0.01001432604 |
| Sum | 3243250 |
| Variance | 185.0997897 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 33 | 3580 | 3.0% |
| 30 | 3087 | 2.6% |
| 32 | 3045 | 2.6% |
| 34 | 3040 | 2.5% |
| 18 | 2926 | 2.5% |
| 21 | 2854 | 2.4% |
| 28 | 2853 | 2.4% |
| 17 | 2805 | 2.3% |
| 20 | 2785 | 2.3% |
| 29 | 2763 | 2.3% |
| Other values (43) | 89652 |
| Value | Count | Frequency (%) |
| 1 | 1047 | |
| 2 | 1218 | |
| 3 | 1319 | |
| 4 | 1487 | |
| 5 | 1387 | |
| 6 | 1508 | |
| 7 | 2109 | |
| 8 | 2216 | |
| 9 | 2117 | |
| 10 | 2149 |
| Value | Count | Frequency (%) |
| 53 | 1816 | |
| 52 | 1195 | |
| 51 | 933 | |
| 50 | 1505 | |
| 49 | 1782 | |
| 48 | 1504 | |
| 47 | 1685 | |
| 46 | 1574 | |
| 45 | 1941 | |
| 44 | 2272 |
arrivalDay
Real number (ℝ≥0)
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.79824106 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 932.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 30 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.780829471 |
|---|---|
| Coefficient of variation (CV) | 0.5558105765 |
| Kurtosis | -1.187168319 |
| Mean | 15.79824106 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | -0.002000453979 |
| Sum | 1886152 |
| Variance | 77.10296619 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 17 | 4406 | 3.7% |
| 5 | 4317 | 3.6% |
| 15 | 4196 | 3.5% |
| 25 | 4160 | 3.5% |
| 26 | 4147 | 3.5% |
| 9 | 4096 | 3.4% |
| 12 | 4087 | 3.4% |
| 16 | 4078 | 3.4% |
| 2 | 4055 | 3.4% |
| 19 | 4052 | 3.4% |
| Other values (21) | 77796 |
| Value | Count | Frequency (%) |
| 1 | 3626 | |
| 2 | 4055 | |
| 3 | 3855 | |
| 4 | 3763 | |
| 5 | 4317 | |
| 6 | 3833 | |
| 7 | 3665 | |
| 8 | 3921 | |
| 9 | 4096 | |
| 10 | 3575 |
| Value | Count | Frequency (%) |
| 31 | 2208 | |
| 30 | 3853 | |
| 29 | 3580 | |
| 28 | 3946 | |
| 27 | 3802 | |
| 26 | 4147 | |
| 25 | 4160 | |
| 24 | 3993 | |
| 23 | 3616 | |
| 22 | 3596 |
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.9275986264 |
| Minimum | 0 |
|---|---|
| Maximum | 19 |
| Zeros | 51998 |
| Zeros (%) | 43.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 932.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 19 |
| Range | 19 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 0.9986134946 |
|---|---|
| Coefficient of variation (CV) | 1.076557755 |
| Kurtosis | 7.174066064 |
| Mean | 0.9275986264 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.38004645 |
| Sum | 110746 |
| Variance | 0.9972289116 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 51998 | |
| 2 | 33308 | |
| 1 | 30626 | |
| 4 | 1855 | 1.6% |
| 3 | 1259 | 1.1% |
| 6 | 153 | 0.1% |
| 5 | 79 | 0.1% |
| 8 | 60 | 0.1% |
| 7 | 19 | < 0.1% |
| 9 | 11 | < 0.1% |
| Other values (7) | 22 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 51998 | |
| 1 | 30626 | |
| 2 | 33308 | |
| 3 | 1259 | 1.1% |
| 4 | 1855 | 1.6% |
| 5 | 79 | 0.1% |
| 6 | 153 | 0.1% |
| 7 | 19 | < 0.1% |
| 8 | 60 | 0.1% |
| 9 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 19 | 1 | < 0.1% |
| 18 | 1 | < 0.1% |
| 16 | 3 | < 0.1% |
| 14 | 2 | < 0.1% |
| 13 | 3 | < 0.1% |
| 12 | 5 | < 0.1% |
| 10 | 7 | < 0.1% |
| 9 | 11 | < 0.1% |
| 8 | 60 | |
| 7 | 19 | < 0.1% |
| Distinct | 35 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.500301533 |
| Minimum | 0 |
|---|---|
| Maximum | 50 |
| Zeros | 7645 |
| Zeros (%) | 6.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 932.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 50 |
| Range | 50 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.908285615 |
|---|---|
| Coefficient of variation (CV) | 0.7632221914 |
| Kurtosis | 24.28455482 |
| Mean | 2.500301533 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.862249242 |
| Sum | 298511 |
| Variance | 3.641553989 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 33684 | |
| 1 | 30310 | |
| 3 | 22258 | |
| 5 | 11077 | 9.3% |
| 4 | 9563 | 8.0% |
| 0 | 7645 | 6.4% |
| 6 | 1499 | 1.3% |
| 10 | 1036 | 0.9% |
| 7 | 1029 | 0.9% |
| 8 | 656 | 0.5% |
| Other values (25) | 633 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 7645 | 6.4% |
| 1 | 30310 | |
| 2 | 33684 | |
| 3 | 22258 | |
| 4 | 9563 | 8.0% |
| 5 | 11077 | 9.3% |
| 6 | 1499 | 1.3% |
| 7 | 1029 | 0.9% |
| 8 | 656 | 0.5% |
| 9 | 231 | 0.2% |
| Value | Count | Frequency (%) |
| 50 | 1 | < 0.1% |
| 42 | 1 | < 0.1% |
| 41 | 1 | < 0.1% |
| 40 | 2 | < 0.1% |
| 35 | 1 | < 0.1% |
| 34 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| 30 | 5 | |
| 26 | 1 | < 0.1% |
adults
Real number (ℝ≥0)
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.856403384 |
| Minimum | 0 |
|---|---|
| Maximum | 55 |
| Zeros | 403 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 932.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 55 |
| Range | 55 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.5792609988 |
|---|---|
| Coefficient of variation (CV) | 0.3120340137 |
| Kurtosis | 1352.115116 |
| Mean | 1.856403384 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 18.31780476 |
| Sum | 221636 |
| Variance | 0.3355433048 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 89680 | |
| 1 | 23027 | 19.3% |
| 3 | 6202 | 5.2% |
| 0 | 403 | 0.3% |
| 4 | 62 | 0.1% |
| 26 | 5 | < 0.1% |
| 5 | 2 | < 0.1% |
| 20 | 2 | < 0.1% |
| 27 | 2 | < 0.1% |
| 6 | 1 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 403 | 0.3% |
| 1 | 23027 | 19.3% |
| 2 | 89680 | |
| 3 | 6202 | 5.2% |
| 4 | 62 | 0.1% |
| 5 | 2 | < 0.1% |
| 6 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 20 | 2 | < 0.1% |
| 26 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 55 | 1 | < 0.1% |
| 50 | 1 | < 0.1% |
| 40 | 1 | < 0.1% |
| 27 | 2 | < 0.1% |
| 26 | 5 | < 0.1% |
| 20 | 2 | < 0.1% |
| 10 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 5 | 2 | < 0.1% |
| 4 | 62 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 932.9 KiB |
| 0.0 | |
|---|---|
| 1.0 | 4861 |
| 2.0 | 3652 |
| 3.0 | 76 |
| 10.0 | 1 |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.000008376 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 110796 | |
| 1.0 | 4861 | 4.1% |
| 2.0 | 3652 | 3.1% |
| 3.0 | 76 | 0.1% |
| 10.0 | 1 | < 0.1% |
| (Missing) | 4 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 110796 | |
| 1.0 | 4861 | 4.1% |
| 2.0 | 3652 | 3.1% |
| 3.0 | 76 | 0.1% |
| 10.0 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 177 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 488 |
| Missing (%) | 0.4% |
| Memory size | 932.9 KiB |
| PRT | |
|---|---|
| GBR | |
| FRA | |
| ESP | |
| DEU | |
| Other values (172) |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.989243242 |
| Min length | 2 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 30 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | PRT |
|---|---|
| 2nd row | PRT |
| 3rd row | GBR |
| 4th row | GBR |
| 5th row | GBR |
Common Values
| Value | Count | Frequency (%) |
| PRT | 48590 | |
| GBR | 12129 | 10.2% |
| FRA | 10415 | 8.7% |
| ESP | 8568 | 7.2% |
| DEU | 7287 | 6.1% |
| ITA | 3766 | 3.2% |
| IRL | 3375 | 2.8% |
| BEL | 2342 | 2.0% |
| BRA | 2224 | 1.9% |
| NLD | 2104 | 1.8% |
| Other values (167) | 18102 | 15.2% |
Length
| Value | Count | Frequency (%) |
| prt | 48590 | |
| gbr | 12129 | 10.2% |
| fra | 10415 | 8.8% |
| esp | 8568 | 7.2% |
| deu | 7287 | 6.1% |
| ita | 3766 | 3.2% |
| irl | 3375 | 2.8% |
| bel | 2342 | 2.0% |
| bra | 2224 | 1.9% |
| nld | 2104 | 1.8% |
| Other values (167) | 18102 | 15.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 932.9 KiB |
| onl | |
|---|---|
| off | |
| gro | |
| dir | |
| cor | 5295 |
| Other values (3) | 982 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | dir |
|---|---|
| 2nd row | dir |
| 3rd row | dir |
| 4th row | cor |
| 5th row | onl |
Common Values
| Value | Count | Frequency (%) |
| onl | 56477 | |
| off | 24219 | |
| gro | 19811 | 16.6% |
| dir | 12606 | 10.6% |
| cor | 5295 | 4.4% |
| com | 743 | 0.6% |
| avi | 237 | 0.2% |
| und | 2 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| onl | 56477 | |
| off | 24219 | |
| gro | 19811 | 16.6% |
| dir | 12606 | 10.6% |
| cor | 5295 | 4.4% |
| com | 743 | 0.6% |
| avi | 237 | 0.2% |
| und | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 932.9 KiB |
| 0 | |
|---|---|
| 1 | 3810 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 115580 | |
| 1 | 3810 | 3.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 115580 | |
| 1 | 3810 | 3.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.08711784907 |
| Minimum | 0 |
|---|---|
| Maximum | 26 |
| Zeros | 112906 |
| Zeros (%) | 94.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 932.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 26 |
| Range | 26 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.8443363842 |
|---|---|
| Coefficient of variation (CV) | 9.691887405 |
| Kurtosis | 674.0736926 |
| Mean | 0.08711784907 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 24.45804872 |
| Sum | 10401 |
| Variance | 0.7129039296 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 112906 | |
| 1 | 6051 | 5.1% |
| 2 | 116 | 0.1% |
| 3 | 65 | 0.1% |
| 24 | 48 | < 0.1% |
| 11 | 35 | < 0.1% |
| 4 | 31 | < 0.1% |
| 26 | 26 | < 0.1% |
| 25 | 25 | < 0.1% |
| 6 | 22 | < 0.1% |
| Other values (5) | 65 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 112906 | |
| 1 | 6051 | 5.1% |
| 2 | 116 | 0.1% |
| 3 | 65 | 0.1% |
| 4 | 31 | < 0.1% |
| 5 | 19 | < 0.1% |
| 6 | 22 | < 0.1% |
| 11 | 35 | < 0.1% |
| 13 | 12 | < 0.1% |
| 14 | 14 | < 0.1% |
| Value | Count | Frequency (%) |
| 26 | 26 | |
| 25 | 25 | |
| 24 | 48 | |
| 21 | 1 | < 0.1% |
| 19 | 19 | < 0.1% |
| 14 | 14 | < 0.1% |
| 13 | 12 | < 0.1% |
| 11 | 35 | |
| 6 | 22 | |
| 5 | 19 | < 0.1% |
| Distinct | 73 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1370969093 |
| Minimum | 0 |
|---|---|
| Maximum | 72 |
| Zeros | 115770 |
| Zeros (%) | 97.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 932.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 72 |
| Range | 72 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.497436848 |
|---|---|
| Coefficient of variation (CV) | 10.92246977 |
| Kurtosis | 767.2452097 |
| Mean | 0.1370969093 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 23.53979995 |
| Sum | 16368 |
| Variance | 2.242317113 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 115770 | |
| 1 | 1542 | 1.3% |
| 2 | 580 | 0.5% |
| 3 | 333 | 0.3% |
| 4 | 229 | 0.2% |
| 5 | 181 | 0.2% |
| 6 | 115 | 0.1% |
| 7 | 88 | 0.1% |
| 8 | 70 | 0.1% |
| 9 | 60 | 0.1% |
| Other values (63) | 422 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 115770 | |
| 1 | 1542 | 1.3% |
| 2 | 580 | 0.5% |
| 3 | 333 | 0.3% |
| 4 | 229 | 0.2% |
| 5 | 181 | 0.2% |
| 6 | 115 | 0.1% |
| 7 | 88 | 0.1% |
| 8 | 70 | 0.1% |
| 9 | 60 | 0.1% |
| Value | Count | Frequency (%) |
| 72 | 1 | |
| 71 | 1 | |
| 70 | 1 | |
| 69 | 1 | |
| 68 | 1 | |
| 67 | 1 | |
| 66 | 1 | |
| 65 | 1 | |
| 64 | 1 | |
| 63 | 1 |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 932.9 KiB |
| A | |
|---|---|
| D | |
| E | 6535 |
| F | 2897 |
| G | 2094 |
| Other values (5) | 2669 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | C |
|---|---|
| 2nd row | C |
| 3rd row | A |
| 4th row | A |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| A | 85994 | |
| D | 19201 | 16.1% |
| E | 6535 | 5.5% |
| F | 2897 | 2.4% |
| G | 2094 | 1.8% |
| B | 1118 | 0.9% |
| C | 932 | 0.8% |
| H | 601 | 0.5% |
| P | 12 | < 0.1% |
| L | 6 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| a | 85994 | |
| d | 19201 | 16.1% |
| e | 6535 | 5.5% |
| f | 2897 | 2.4% |
| g | 2094 | 1.8% |
| b | 1118 | 0.9% |
| c | 932 | 0.8% |
| h | 601 | 0.5% |
| p | 12 | < 0.1% |
| l | 6 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 932.9 KiB |
| A | |
|---|---|
| D | |
| E | |
| F | 3751 |
| G | 2553 |
| Other values (7) | 5905 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | C |
|---|---|
| 2nd row | C |
| 3rd row | C |
| 4th row | A |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| A | 74053 | |
| D | 25322 | 21.2% |
| E | 7806 | 6.5% |
| F | 3751 | 3.1% |
| G | 2553 | 2.1% |
| C | 2375 | 2.0% |
| B | 2163 | 1.8% |
| H | 712 | 0.6% |
| I | 363 | 0.3% |
| K | 279 | 0.2% |
| Other values (2) | 13 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| a | 74053 | |
| d | 25322 | 21.2% |
| e | 7806 | 6.5% |
| f | 3751 | 3.1% |
| g | 2553 | 2.1% |
| c | 2375 | 2.0% |
| b | 2163 | 1.8% |
| h | 712 | 0.6% |
| i | 363 | 0.3% |
| k | 279 | 0.2% |
| Other values (2) | 13 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2211240472 |
| Minimum | 0 |
|---|---|
| Maximum | 21 |
| Zeros | 101314 |
| Zeros (%) | 84.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 932.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 21 |
| Range | 21 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.6523055727 |
|---|---|
| Coefficient of variation (CV) | 2.949953118 |
| Kurtosis | 79.39360467 |
| Mean | 0.2211240472 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.000270054 |
| Sum | 26400 |
| Variance | 0.4255025601 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 101314 | |
| 1 | 12701 | 10.6% |
| 2 | 3805 | 3.2% |
| 3 | 927 | 0.8% |
| 4 | 376 | 0.3% |
| 5 | 118 | 0.1% |
| 6 | 63 | 0.1% |
| 7 | 31 | < 0.1% |
| 8 | 17 | < 0.1% |
| 9 | 8 | < 0.1% |
| Other values (11) | 30 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 101314 | |
| 1 | 12701 | 10.6% |
| 2 | 3805 | 3.2% |
| 3 | 927 | 0.8% |
| 4 | 376 | 0.3% |
| 5 | 118 | 0.1% |
| 6 | 63 | 0.1% |
| 7 | 31 | < 0.1% |
| 8 | 17 | < 0.1% |
| 9 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 18 | 1 | < 0.1% |
| 17 | 2 | < 0.1% |
| 16 | 2 | < 0.1% |
| 15 | 3 | |
| 14 | 5 | |
| 13 | 5 | |
| 12 | 2 | < 0.1% |
| 11 | 2 | < 0.1% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 932.9 KiB |
| No Deposit | |
|---|---|
| Non Refund | |
| Refundable | 162 |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No Deposit |
|---|---|
| 2nd row | No Deposit |
| 3rd row | No Deposit |
| 4th row | No Deposit |
| 5th row | No Deposit |
Common Values
| Value | Count | Frequency (%) |
| No Deposit | 104641 | |
| Non Refund | 14587 | 12.2% |
| Refundable | 162 | 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| no | 104641 | |
| deposit | 104641 | |
| non | 14587 | 6.1% |
| refund | 14587 | 6.1% |
| refundable | 162 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 128 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.321149175 |
| Minimum | 0 |
|---|---|
| Maximum | 391 |
| Zeros | 115692 |
| Zeros (%) | 96.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 932.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 391 |
| Range | 391 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 17.59472088 |
|---|---|
| Coefficient of variation (CV) | 7.580176694 |
| Kurtosis | 186.7930696 |
| Mean | 2.321149175 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 11.94435345 |
| Sum | 277122 |
| Variance | 309.5742028 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 115692 | |
| 39 | 227 | 0.2% |
| 58 | 164 | 0.1% |
| 44 | 141 | 0.1% |
| 31 | 127 | 0.1% |
| 35 | 96 | 0.1% |
| 46 | 94 | 0.1% |
| 69 | 89 | 0.1% |
| 63 | 83 | 0.1% |
| 87 | 80 | 0.1% |
| Other values (118) | 2597 | 2.2% |
| Value | Count | Frequency (%) |
| 0 | 115692 | |
| 1 | 12 | < 0.1% |
| 2 | 5 | < 0.1% |
| 3 | 59 | < 0.1% |
| 4 | 25 | < 0.1% |
| 5 | 8 | < 0.1% |
| 6 | 16 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 7 | < 0.1% |
| 9 | 16 | < 0.1% |
| Value | Count | Frequency (%) |
| 391 | 45 | |
| 379 | 15 | < 0.1% |
| 330 | 15 | < 0.1% |
| 259 | 10 | < 0.1% |
| 236 | 35 | |
| 224 | 10 | < 0.1% |
| 223 | 61 | |
| 215 | 21 | < 0.1% |
| 207 | 15 | < 0.1% |
| 193 | 1 | < 0.1% |
customerSegment
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 932.9 KiB |
| T | |
|---|---|
| C | 4076 |
| G | 577 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | T |
|---|---|
| 2nd row | T |
| 3rd row | T |
| 4th row | T |
| 5th row | T |
Common Values
| Value | Count | Frequency (%) |
| T | 114737 | |
| C | 4076 | 3.4% |
| G | 577 | 0.5% |
Length
Pie chart
| Value | Count | Frequency (%) |
| t | 114737 | |
| c | 4076 | 3.4% |
| g | 577 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5713627607 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 70318 |
| Zeros (%) | 58.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 932.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.7927984228 |
|---|---|
| Coefficient of variation (CV) | 1.387557043 |
| Kurtosis | 1.492564811 |
| Mean | 0.5713627607 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.349189377 |
| Sum | 68215 |
| Variance | 0.6285293392 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 70318 | |
| 1 | 33226 | |
| 2 | 12969 | 10.9% |
| 3 | 2497 | 2.1% |
| 4 | 340 | 0.3% |
| 5 | 40 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 70318 | |
| 1 | 33226 | |
| 2 | 12969 | 10.9% |
| 3 | 2497 | 2.1% |
| 4 | 340 | 0.3% |
| 5 | 40 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 40 | < 0.1% |
| 4 | 340 | 0.3% |
| 3 | 2497 | 2.1% |
| 2 | 12969 | 10.9% |
| 1 | 33226 | |
| 0 | 70318 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| Unnamed: 0 | type | canceledFlag | time2Checkin | arrivalMonth | arrivalWeek | arrivalDay | numberWeekendnights | numberNights | adults | chidren | country | segment | repeatFlag | historicCancellations | historicBookings | roomType | assignedType | changesFlag | deposit | waitingDays | customerSegment | numberofRequests | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | R | 0 | 342 | July | 27 | 1 | 0 | 0 | 2 | 0.0 | PRT | dir | 0 | 0 | 0 | C | C | 3 | No Deposit | 0 | T | 0 |
| 1 | 1 | R | 0 | 737 | July | 27 | 1 | 0 | 0 | 2 | 0.0 | PRT | dir | 0 | 0 | 0 | C | C | 4 | No Deposit | 0 | T | 0 |
| 2 | 2 | R | 0 | 7 | July | 27 | 1 | 0 | 1 | 1 | 0.0 | GBR | dir | 0 | 0 | 0 | A | C | 0 | No Deposit | 0 | T | 0 |
| 3 | 3 | R | 0 | 13 | July | 27 | 1 | 0 | 1 | 1 | 0.0 | GBR | cor | 0 | 0 | 0 | A | A | 0 | No Deposit | 0 | T | 0 |
| 4 | 4 | R | 0 | 14 | July | 27 | 1 | 0 | 2 | 2 | 0.0 | GBR | onl | 0 | 0 | 0 | A | A | 0 | No Deposit | 0 | T | 1 |
| 5 | 5 | R | 0 | 14 | July | 27 | 1 | 0 | 2 | 2 | 0.0 | GBR | onl | 0 | 0 | 0 | A | A | 0 | No Deposit | 0 | T | 1 |
| 6 | 6 | R | 0 | 0 | July | 27 | 1 | 0 | 2 | 2 | 0.0 | PRT | dir | 0 | 0 | 0 | C | C | 0 | No Deposit | 0 | T | 0 |
| 7 | 7 | R | 0 | 9 | July | 27 | 1 | 0 | 2 | 2 | 0.0 | PRT | dir | 0 | 0 | 0 | C | C | 0 | No Deposit | 0 | T | 1 |
| 8 | 8 | R | 1 | 85 | July | 27 | 1 | 0 | 3 | 2 | 0.0 | PRT | onl | 0 | 0 | 0 | A | A | 0 | No Deposit | 0 | T | 1 |
| 9 | 9 | R | 1 | 75 | July | 27 | 1 | 0 | 3 | 2 | 0.0 | PRT | off | 0 | 0 | 0 | D | D | 0 | No Deposit | 0 | T | 0 |
Last rows
| Unnamed: 0 | type | canceledFlag | time2Checkin | arrivalMonth | arrivalWeek | arrivalDay | numberWeekendnights | numberNights | adults | chidren | country | segment | repeatFlag | historicCancellations | historicBookings | roomType | assignedType | changesFlag | deposit | waitingDays | customerSegment | numberofRequests | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 119380 | 119380 | C | 0 | 44 | August | 35 | 31 | 1 | 3 | 2 | 0.0 | DEU | onl | 0 | 0 | 0 | A | A | 0 | No Deposit | 0 | T | 1 |
| 119381 | 119381 | C | 0 | 188 | August | 35 | 31 | 2 | 3 | 2 | 0.0 | DEU | dir | 0 | 0 | 0 | A | A | 0 | No Deposit | 0 | T | 0 |
| 119382 | 119382 | C | 0 | 135 | August | 35 | 30 | 2 | 4 | 3 | 0.0 | JPN | onl | 0 | 0 | 0 | G | G | 0 | No Deposit | 0 | T | 0 |
| 119383 | 119383 | C | 0 | 164 | August | 35 | 31 | 2 | 4 | 2 | 0.0 | DEU | off | 0 | 0 | 0 | A | A | 0 | No Deposit | 0 | T | 0 |
| 119384 | 119384 | C | 0 | 21 | August | 35 | 30 | 2 | 5 | 2 | 0.0 | BEL | off | 0 | 0 | 0 | A | A | 0 | No Deposit | 0 | T | 2 |
| 119385 | 119385 | C | 0 | 23 | August | 35 | 30 | 2 | 5 | 2 | 0.0 | BEL | off | 0 | 0 | 0 | A | A | 0 | No Deposit | 0 | T | 0 |
| 119386 | 119386 | C | 0 | 102 | August | 35 | 31 | 2 | 5 | 3 | 0.0 | FRA | onl | 0 | 0 | 0 | E | E | 0 | No Deposit | 0 | T | 2 |
| 119387 | 119387 | C | 0 | 34 | August | 35 | 31 | 2 | 5 | 2 | 0.0 | DEU | onl | 0 | 0 | 0 | D | D | 0 | No Deposit | 0 | T | 4 |
| 119388 | 119388 | C | 0 | 109 | August | 35 | 31 | 2 | 5 | 2 | 0.0 | GBR | onl | 0 | 0 | 0 | A | A | 0 | No Deposit | 0 | T | 0 |
| 119389 | 119389 | C | 0 | 205 | August | 35 | 29 | 2 | 7 | 2 | 0.0 | DEU | onl | 0 | 0 | 0 | A | A | 0 | No Deposit | 0 | T | 2 |